NatServer: A Client-Server Architecture for building Parallel Corpora applications
نویسندگان
چکیده
Parallel corpora are important resources for most Natural Language processing tasks. From the common applications, like machine translation, to the usually mono-lingual tasks as paraphrase detection and word sense disambiguation, most researchers are using massive parallel corpora. Thus, the availability of an efficient way to manage them is very important. This paper presents a ClientServer architecture to query efficiently parallel corpora and probabilistic translation dictionaries.
منابع مشابه
An Open Architecture for the Construction and Administration of Corpora
The use of language corpora for a variety of purposes has increased significantly in recent years. General corpora are now available for many languages, but research often requires more specialized corpora. The rapid development of the World Wide Web has greatly improved access to data in electronic form, but research has tended to focus on corpus annotation, rather than on corpus building tool...
متن کاملCollaboratively Building Language Resources while Localising the Web
In this paper, we propose the collaborative construction of language resources (translation memories) using a novel browser extension-based client-server architecture that allows translation (or ‘localisation’) of web content capturing and aligning source and target content produced by the ‘power of the crowd’. The architectural approach chosen enables collaborative, in-context, and realtime lo...
متن کاملWEA, a Distributed Object Manager Based on a Workspace Hierarchy
WEA is our implementation of a new architectural model for virtual memory access, the WorkSpace. It relies on a generalisation of client / server model and enables to build new distributed applications. The workspace supplies uniform access to a distributed persistent object store. This paper describes several ways of building multi-workspace architecture, and an implementation of this architec...
متن کاملA Novel Method for VANET Improvement using Cloud Computing
In this paper, we present a novel algorithm for VANET using cloud computing. We accomplish processing, routing and traffic control in a centralized and parallel way by adding one or more server to the network. Each car or node is considered a Client, in such a manner that routing, traffic control, getting information from client and data processing and storing are performed by one or more serve...
متن کاملA Client/Server Architecture for Word Sense Disambiguation
This paper presents a robust client/server implementation of a word sense disambiguator for English. This system associates a word with its meaning in a given context using dictionaries as tagged corpora in order to extract semantic disambiguation rules. Semantic rules are used as input of a semantic application program which encodes a linguistic strategy in order to select the best disambiguat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Procesamiento del Lenguaje Natural
دوره 37 شماره
صفحات -
تاریخ انتشار 2006